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ABSTRACT 


We  exhibit  two  studies  (one  epidemiological  and  one  clinical),  both  with 
apparently  paradoxical  findings  characterized  by  group  (index  versus  control)  similarity 
on  the  dependent  (health)  variable  (Y)  means,  a  significant  group  difference  on  the 
independent  variable  (X)  means  (index  mean  greater  than  the  control  mean)  and  a 
positive  correlation  between  Y  and  X  in  the  index  group,  causing  index  subjects  with  low 
values  of  X  to  have  a  lower  Y  mean  than  the  controls  and  index  subjects  having  high 
values  of  X  to  have  a  higher  Y  mean  than  the  controls.  This  pattern  has  been  called  the 
“check  mark”  pattern.  We  predict  this  pattern  using  a  linear  model  and  use  the  model  to 
estimate  exposure  effects  in  the  epidemiologic  study.  Additionally,  we  show  that  a 
previously  published  study  of  the  check  mark  pattern  suggesting  reverse  causation  is 
incorrect. 


Key  words:  check  mark  pattern,  epidemiologic  studies,  observational  studies 
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1.  INTRODUCTION 


We  consider  medical  studies  of  a  health  measure  (Y)  and  an  independent  variable 
(X)  in  patients  classified  to  index  and  control  groups.  In  clinical  studies  the  index 
subjects  are  generally  “cases”,  who  have  been  diagnosed  as  having  a  disease  or  syndrome 
and  in  epidemiological  studies  of  health  and  exposure  to  some  potentially  harmful 
substance,  the  index  subjects  are  identified  as  having  been  “exposed”  to  the  substance.  In 
the  epidemiolgical  study,  X  is  a  biomarker  for  exposure  and  in  the  clinical  study  X  is  a 
measure  of  disease  or  syndrome  severity.  In  both  types  of  studies  we  assume  that  the 
mean  value  of  X  in  controls  is  less  than  the  mean  value  of  X  in  index  subjects.  Thus,  if 
index  and  control  subjects  are  otherwise  similar  one  would  expect  that  index  subjects 
with  low  values  of  X  should  have  a  similar  Y  mean  as  control  subjects  and,  if  X  is 
positively  correlated  with  Y  in  the  index  group,  one  would  expect  that  index  subjects 
with  large  values  of  X  would  have  a  larger  Y  mean  than  controls.  In  fact,  because  index 
subjects  have  a  larger  mean  value  of  X,  one  would  further  expect  that  the  overall  index  Y 
mean  would  be  greater  than  the  overall  control  Y  mean.  In  this  paper,  we  consider 
studies  where  this  expected  pattern  is  not  observed.  The  observed  pattern  is  one  in  which 
the  index  and  control  means  on  Y  are  nearly  equal  and  X  and  Y  are  positively  correlated 
in  the  index  group.  In  this  situation,  the  Y  mean  among  index  subjects  with  low  values  of 
X  will  be  less  than  the  control  Y  mean  and  the  Y  mean  among  index  subjects  with  high 
values  of  X  will  be  greater  than  the  control  Y  mean.  Two  examples  of  this  “check  mark” 
pattern  are  presented,  modeled  and  interpreted.  The  first  example  arises  in  an 
epidemiological  study  and  the  second  in  a  clinical  study. 
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There  is  disagreement  regarding  the  interpretation  of  the  check  mark  pattern  in 
epidemiological  studies.  A  recent  study  of  the  pattern  concluded  that  the  pattern  is 
suggestive  of  reverse  causation,  in  which  Y  causes  changes  in  X  rather  than  X  causing 
changes  in  Y  (Flanders  et  al  1992).  The  Flanders  paper  has  been  cited  by  the  Institute  of 
Medicine  (1994)  in  their  recent  review  of  Agent  Orange  research.  We  show  that 
Flanders’  arguments  are  incorrect  and  give  alternative  interpretations  for  the  pattern. 

The  purpose  of  this  paper  is  to  describe  the  check  mark  pattern,  both  empirically 
(in  Section  2)  and  with  statistical  models  (in  Section  3),  to  use  the  models  to  estimate 
exposure  effects  in  the  epidemiological  example,  to  explore  reverse  order  causation  in  the 
context  of  our  models  and  to  indicate  an  error  in  the  reverse  order  causation  argument  of 
Flanders  et  al  (1992).  We  do  not  intend  to  provide  a  statistical  analysis  of  these  data,  as 
they  have  already  been  analyzed  elsewhere  in  rigorous  detail. 
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2.  EXAMPLES 


We  present  two  examples  of  the  check  mark  pattern  in  medical  studies. 

Example  2.1 

The  Air  Force  Health  Study  (AFHS)  is  a  20-year  prospective  epidemiological 
study  of  possible  health  effects  from  exposure  to  herbicides  and  their  contaminant, 
2,3,7,8-tetrachlorodibenzo-p-dioxin  (dioxin)  in  veterans  of  Operation  Ranch  Hand,  the 
unit  responsible  for  spraying  Agent  Orange  and  other  herbicides  in  Vietnam  from  1962  to 
1971  (Wolfe  et  al  1990).  A  group  of  Air  Force  veterans  who  served  in  Southeast  Asia 
during  the  same  period  but  who  were  not  involved  with  spraying  herbicides  serve  as 
controls.  Control  veterans  were  matched  to  Ranch  Hand  veterans  on  age,  military 
occupation  and  race.  The  study  compares  the  health,  reproductive  outcomes  and 
mortality  of  Ranch  Hand  veterans  with  control  veterans.  Physical  examinations  were 
administered  in  1982,  1985,  1987  and  1992.  Examinations  are  planned  for  1997  and 
2002.  Since  1987,  a  measurement  of  dioxin  in  serum  has  been  used  as  the  index  of 
exposure  (Wolfe  et  al  1992). 

Nine  hundred  fifty  two  Ranch  Hands  and  1,281  controls  attended  the  1992  AFHS 
physical  examination  of  whom  894  Ranch  Hands  and  1,084  controls  had  quantitatable 
dioxin  results  (Grubbs  et  al  1995).  An  investigation  of  endocrine  function 
included  serum  insulin,  measured  in  mlU/ml.  Because  we  were  interested  in  the 
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association  between  dioxin  (X)  and  insulin  (Y)  in  nondiabetic  subjects,  we  excluded 
known  diabetics  and  subjects  with  2  hour  post-prandial  glucose  levels  greater  than  200 
mg/ml  (125  controls  and  115  Ranch  Hands),  leaving  959  controls  and  779  Ranch  Hands 
with  insulin  and  dioxin  levels  for  consideration. 

We  studied  the  relation  between  insulin  and  dioxin  after  both  had  been 
logarithmically  transformed  (the  number  1  was  added  to  the  dioxin  result  before  taking 
the  logarithm)  using  linear  regression  and  t-tests.  The  data  are  shown  in  Figure  1,  with  a 
least-squares  line  overlaid  on  the  Ranch  Hand  data.  The  coefficient  of  log(dioxin+l)  in 
the  regression  of  log(insulin)  in  Ranch  Hands  is  statistically  significant  (slope=0.18, 
standard  erron=0.031). 
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Figure  1 :  Log(lnsulin)  vs  Log(Dioxin+1 )  by  Group 
in  the  1992  Air  Force  Health  Study 

Controls  (N=960)  Ranch  Hand  (N=779) 


Log(Dioxin+1) 


Log(Dioxin+1 ) 


Table  1  gives  summary  statistics  by  group. 


Table  1:  Log(Dioxin+l)  and  Log(Insulin)  by  Group 


Log(dioxin+l) 

Log(insulin) 

Group 

N 

Mean  (Std  Dev) 

Mean  (Std  Dev) 

Ranch  Hand 

779 

2.7  (0.99) 

4.24  (0.88) 

Control 

959 

1.8(0.44) 

4.25  (0.83) 

By  design,  the  mean  of  log(dioxin+l)  in  Ranch  Hands  (2.7)  is  greater  than  the  mean  of 
log(dioxin+l)  in  controls  (1.8).  The  group  log(insulin)  means  were  not  significantly 
different  [mean  difference=-0.01,  95%  Cl:  (-0.07,  0.09)]. 
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We  stratified  the  data  to  four  categories  according  to  group  and  dioxin  level.  The 


first  category  was  comprised  of  control  subjects  with  dioxin  less  than  10  ppt,  the  value 
we  regard  as  the  threshold  for  background  exposure.  The  remaining  three  categories, 
named  “Low”,  “Medium”  and  “High”  were  determined  by  the  cut  points  10  ppt  and  24 
ppt  in  the  dioxin  distribution  in  the  Ranch  Hand  (RH)  group.  Table  2  gives  summary 
statistics  for  log(insulin)  and  log(dioxin+l)  in  each  of  these  four  dioxin  categories. 


Table  2:  Log(Dioxin+l)  and  Log(Insulin)  by  Dioxin  Category 


Dioxin 

Category 

N 

Log(dioxin+1) 

Mean  (Std  Dev) 

Log(insulin) 

Mean  (Std  Dev) 

Control 

942 

1.5  (0.55) 

4.2  (0.83) 

RH  Low 

341 

1.8  (0.44) 

4.1  (0.84) 

RH  Medium 

221 

2.8  (0.24) 

4.3  (0.87) 

RH  High 

217 

4.0  (0.59) 

4.4  (0.90) 
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The  data  are  summarized  in  Figure  2  by  dioxin  category,  showing  the  check  mark  pattern. 


Figure  2:  Log(lnsulin)  by  Dioxin  Category 
in  the  Air  Force  Health  Study 


Controls  Low  Medium  High 

Dioxin  Category 


LEGEND:  Low,  Medium  and  High  are  Dioxin  categories  in  Ranch  Hand 
subjects.  Whiskers  extend  to  the  5th  and  95th  percentiles. 

Dots  are  lower  and  upper  5  percent.  Rectangles  are  determined 
bv  auartiles.  and  the  mean  is  indicated  with  a  +  . 


Example  2.2 

A  recent  study  of  excitatory  amino  acids  in  cerebral  spinal  fluid  (CSF)  and 
severity  of  pain  or  tenderness  in  patients  with  fibromyalgia  syndrome  (FS)  included  41 
patients  diagnosed  as  having  FS  and  37  age,  sex  and  ethnicity  matched  healthy  normal 
controls;  we  call  this  study  the  CSF  excitatory  amino  acid  (CSF-EAA)  study. 

In  the  CSF-EAA  study  CSF  samples  were  obtained  from  each  individual  by 
standard  lumbar  puncture.  Taurine  (Y)  was  measured  in  CSF  as  previously  described 
using  specific  radioimmunoassays  (Giovengo  et  al  1995)  and  was  reported  in  ng/pl.  The 
severity  of  tenderness  (X)  was  measured  by  the  tender  point  index  (TPI),  which  involved 
digital  palpation  at  18  specific  soft  tissue  sites  (Wolfe  et  al  1990).  In  addition  to 
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inquiring  about  pain  induced  by  palpation,  the  physician  closely  observed  the  patient  for 
any  physical  response  and  scored  each  site  as  follows:  0  (no  pain),  1  (tenderness  reported 
but  without  physical  response),  2  (“semi -objective”  tenderness  demonstrated  by  a 
physical  response  such  as  a  wince  or  withdrawal),  3  (very  exaggerated  physical  response) 
and  4  (untouchable;  patient  will  not  allow  palpation  at  a  given  site  fearing  unbearable 
pain).  The  diagnosis  of  FS  was  based  on  the  continuous  presence  of  musculoskeletal 
pain  for  3  months  and  “semi-objective”  tenderness  at  1 1  or  more  of  the  18  tender  points 
(Wolfe  et  al  1990).  The  data  are  shown  in  Figure  3,  with  a  least-squares  line  overlaid  on 
the  FS  data.  The  coefficient  of  TPI  in  the  regression  of  taurine  in  FS  patients  is 
statistically  significant  (slope=0.12,  standard  error=0.031). 

jFigure  3:  Taurine  in  CSF  and  the  Tender  Point  Index 

I  in  the  Excitatory  Amino  Acid  Study 


15 

10 

0) 

c 

3 

TO 

\~ 

5 

0 


Controls  (N=37) 


Fibromyalgia  (N=41) 


Tender  Point  Index 


Tender  Point  Index 
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Table  3  gives  summary  statistics  by  group. 


Table  3:  Tender  Point  Index  and  Taurine  by  Group 


Tender  Point  Index 

Taurine 

Group 

N 

Mean  (Std  Dev) 

Mean  (Std  Dev) 

Fibromyalgia  Syndrome 

41 

34.9(11.89) 

6.6  (2.75) 

Control 

37 

0.4  (0.98) 

5.9  (2.61) 

By  design,  the  FS  mean  on  TPI  (34.9)  is  greater  than  the  control  mean  (0.4).  The  mean 
value  of  taurine  did  not  vary  significantly  with  group  [mean  difference^.  13  ng/p.1,  95% 
Cl:  (-0.06  ng/pl ,  2.32  ng/pl)]. 

We  stratified  the  data  to  four  categories  according  to  group  and  TPI  level.  The 
first  category  was  comprised  of  control  subjects.  The  remaining  three  categories,  named 
“Low”,  “Medium”  and  “High”  were  determined  by  the  textiles  (3 1  and  40)  of  the  TPI 
distribution  in  FS  patients.  Table  4  gives  summary  statistics  for  taurine  and  the  TPI  in 
each  of  these  four  TPI  categories. 
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Table  4:  The  Tender  Point  Index  and  Taurine  by  TPI  Category 


TPI 

Category 

N 

Tender  Point  Index 

Mean  (Std  Dev) 

Taurine  (ng/^il) 

Mean  (Std  Dev) 

Control 

37 

0.35  (0.98) 

5.88  (2.62) 

FS  Low 

15 

22.3  (5.62) 

5.29  (2.26) 

FS  Medium 

14 

36.1  (2.43) 

6.58  (1.80) 

FS  High 

12 

49.4  (4.21) 

8.16(3.49) 

The  data  are  summarized  in  Figure  4  by  TPI  category,  showing  the  check  mark 


pattern. 


Figure  4:  Taurine  by  TPI  Category 
in  the  Excitatory  Amino  Acid  Study 


15 

12 


v  9 

*c 

to 

I-  6 


3 

0 


Controls  Low  Medium  High 

TPI  Category 


LEGEND:  Low,  Medium  and  High  are  TPI  categories  in  FS  patients. 
Whiskers  extend  to  the  5th  and  95th  percentiles. 

Dots  are  lower  and  upper  5  percent.  Rectangles  are  determined 
bv  ouartiles  and  the  mean  is  indicated  with  a  +. 
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3.  MODELING  THE  CHECK  MARK  PATTERN 
3.1  Definition  of  the  Check  mark  Pattern 

The  patterns  in  each  of  these  two  examples  are  caused  by  nearly  equal  group 
means  on  Y  and  a  positive  correlation  between  X  and  Y  in  the  index  group.  The 
conditional  means  shown  in  Tables  2  and  4  then  exhibit  the  pattern,  with  the  control 
mean  on  the  health  variable  (Y)  being  greater  than  the  index  mean  in  the  Low  category 
and  an  increasing  sequence  of  conditional  means  on  Y  in  the  index  categories  (Low, 
Medium  and  High)  determined  by  two  cut  points  (cu  and  C21)  in  the  distribution  of  the 
independent  variable  (X)  in  the  index  group,  where  cn  and  C12  are  approximate  tertiles  of 
X  in  the  index  group.  The  control  category  is  comprised  of  control  subjects,  possibly 
truncated  with  X  at  or  below  a  cut  point  cio,  which  we  assume  as  a  high  quantile  of  the 
distribution  of  X  in  the  control  group.  To  formalize  this  phenomenon,  we  introduce 
notation  for  the  conditional  expectations  in  Table  5,  where  group  is  indicated  by  j 
(control:j=0,  Index:j=l). 


Table  5:  Category  Definitions  and  Conditional  Expectations 


Category 

■ 

Interval 

Label 

E(Y|j,XeAk) 

Control 

0 

(-<»,  Cio) 

Ao 

PyO 

Index  Low 

1 

(-°0»  Cn) 

A, 

Pyl 

Index  Medium 

1 

(cn,C2i) 

a2 

Py2 

Index  High 

1 

(C21,00) 

a3 

Py3 

13 


The  check  mark  pattern  is  said  to  hold  if  (j.yo>|a.yi  and  (J.yi<(J.y2<M-y3-  In  the  AFHS, 
c i o=  1 0  ppt,  Ci  i=10  ppt  and  C2i=24  ppt  and  in  the  CSF-EAA  study  Cio=co,  Cn=31  and 
c2i=40. 


3.2  Statistical  Models  to  Predict  the  Check  mark  Pattern 

We  propose  standard  linear  statistical  models  to  predict  the  observed  patterns  in 
Tables  2  and  4.  Let  5jj  =1  if  subject  i  is  in  group  j,  0  otherwise,  j=0,l,  i=l,  2, nj,  and 
consider  the  following  linear  model  for  subject  i  in  group  j, 

Yji  =  P0  +  /^(Xoi-/Lo)<5oi  +  (Xii-/Ai)£ii]  +  £ji  >  (1) 

where,  for  subject  i  in  group  j,  j=0,l,  i=l,2, nj,  Yji  and  Xjj  are  the  observed  values  of  Y 
and  X,  pxj  is  the  population  mean  of  X  from  group  j  and  Sji  is  a  random  error  independent 
of  Xjj  and  bjj  with  mean  0  and  variance  cr2.  Under  model  (1),  E(Yoj)=E(Y)j)=po,  that  is, 
the  overall  means  on  Y  in  the  two  groups  are  equal.  Furthermore,  if  Pi>0,  it  is  easy  to 
verify  that  model  (1)  predicts  the  check  mark  pattern.  Using  model  (1),  we  have 

Fyo=Po+P  i  [E(Xoi|X0j<ci  o  )-E(Xoi)] .  (2) 

But,  since  cio  is  a  high  quantile  of  the  control  distribution  of  X,  the  coefficient  of  Pi  in 
(2)  is  approximately  zero.  However, 

py,=Po+P,[E(X,j|X,j<c,i  )-E(X,j)]  (3) 

and  the  coefficient  of  pi  in  (3)  is  negative  because  Cn  is  an  approximate  lower  tertile  of 
the  distribution  of  X  in  the  index  group.  Thus,  pyo>pyi.  The  remaining  conditional 
means  follow  the  ordering  pyi<py2<py3  because  Pi>0. 
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In  the  special  case  that  all  of  the  controls  have  X=0  and  the  index  subjects  have 
positive  values  of  X,  as  in  the  CSF-EAA  study,  we  reduce  model  (1)  to 

Yji  =  /?0  +  A[(Xi.-^,)^i]  +  £ji.  (4) 

Once  again,  it  is  easy  to  verify  that  the  reduced  model  (4)  predicts  equal  means  and  the 
check  mark  pattern  when  (3i>0. 

Model  (1)  implies  that  p0  is  the  population  mean  of  group  1  as  well  as  of  group  0. 
Therefore,  p0  is  estimated  by  y0 ,  Yj  or  Y  =  (no  Yo  +  m  Yj)  /  (no  +  m).  Assuming  a 
common  variance  in  the  two  groups,  it  is  better  to  estimate  Po  with  Y  because  its  variance 
is  smaller  than  that  of  Yo  or  Yi  •  Therefore,  to  accommodate  this  in  our  models,  so  that 
standard  statistical  software  packages  can  be  used,  at  the  time  of  fitting  the  models,  we 
replace  the  means  pXJ  by  their  estimators  Xj  >  j— 0,1 .  Based  on  (1),  the  conditional 

log(insulin)  means  are  =  4.24,  £yl  =  4.08,  //y2  =  4.26  and  //y3  =  4.49 ,  which  are  very 

comparable  to  the  observed  means  in  Table  2.  The  predicted  conditional  means  using 
model  (4)  for  taurine,  are  =  6.25,  //yl  =  4.68,  /uy2  =  6.37  and  jii  =  8.03, 

corresponding  to  the  observed  means  in  Table  4.  Thus,  model  (1)  and  its  reduction  (4) 
describe  the  check  mark  patterns  given  in  the  two  examples. 
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3.3  Reverse  Causation 


At  this  point  we  should  mention  that  Flanders  et  al  (1992)  conducted  a  study  of 
the  check  mark  pattern  using  AFHS  data.  They  studied  two  models;  their  first  model 
considered  a  relationship  between  health  and  dioxin  with  health  as  the  dependent  variable 
and  dioxin  as  the  independent  variable  and  their  second  model  reversed  the  roles  of  these 
two  variables.  When  analyzing  their  second  model,  they  made  the  assumption  that 
E(Y[j=0)=E(Y[j=l),  i.e.  the  Ranch  Hand  and  control  health  means  are  equal,  while  not 
making  this  assumption  for  their  first  model.  As  we  show  in  the  Appendix,  under  this 
assumption  the  coefficient  of  health  in  their  second  model  is  identically  zero  (which  was 
not  recognized  by  Flanders  et  al),  making  their  model  not  very  useful.  This  mathematical 
oversight  caused  them  to  believe  that  their  second  “reverse  causation”  model  was  the 
appropriate  model  and  hence  concluded  that  the  check  mark  pattern  suggested  reverse 
causation.  Therefore,  the  mathematical  arguments  used  by  Flanders  et  al  to  argue  that  the 
check  mark  pattern  suggests  reverse  causation  are  incorrect. 

To  explore  the  possibility  of  reverse  causation  in  the  context  of  our  model  (1), 
suppose  that  reverse  causation  actually  holds  and  consider  a  revision  of  model  (1)  with 
dioxin  as  the  dependent  variable  and  health  as  the  independent  variable,  given  by 

Yji  =  /?0  +  A  [(X;,-  -  Mj  <?oi  +  (Xu  -  A*i)  <?h]  +  £ji ,  (5) 

where  Yji  is  the  dioxin  level  and  xji  is  the  health  variable  of  subject  i  in  group  j,  and 

/A  is  the  health  variable  mean  in  group  j,  j=0,l .  Now  assume,  that  the  health  variable 

means  in  the  two  groups  are  equal,  E(X*[j=0)=E(X*[j=l).  Then,  using  model  (5)  we  find 
that  E(Yoi)  =  E(Yh)  ,  implying  that  the  mean  dioxin  levels  in  the  two  group  are  the  same, 
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which  violates  the  known  difference  between  the  Ranch  Hand  and  control  dioxin 
distributions.  Note,  however,  that  this  happens  regardless  of  the  assumption  of  equal 
group  means.  Hence,  one  cannot  conclude  reverse  causation  by  a  simple  reversal  of  the 
roles  of  X  and  Y  in  model  (1). 

Now,  one  may  argue  that  although  model  (5)  does  not  predict  the  check  mark 
pattern  and  so  does  not  support  reverse  causation,  that  there  might  be  another  model  that 
describes  the  relationship  between  Y*  and  X*  and  also  predicts  the  check  mark  pattern. 
Hence,  we  consider  a  more  general  model 

Yji  =  h(0,Xji)  +  £ji ,  (6) 

where  h(0,Xji)  is  some  function  of  a  vector  of  parameters  0  and  Xji  and  Sjj  has  mean  0 
and  variance  a2.  Here,  we  are  working  under  the  assumption  that  Yji  (dioxin)  is  the 
dependent  variable  and  that  Xji  (health)  is  the  independent  variable.  Now,  following 
Flanders  argument,  to  produce  the  check  mark  pattern  we  need  conditional  expectations 
of  (6)  (given  that  dioxin  is  contained  in  an  interval),  such  as 

E(  Yoi  |  Yoi  <  clo)  =  E[h(0,  X’od  +  Soi  |  Yoi  <  do]  •  (7) 

However,  the  expectation  on  the  right  hand  side  of  (7)  violates  the  traditional  definition 
of  regression,  as  it  involves  the  conditional  expectation  of  some  function  of  the 
independent  variable  conditional  on  the  dependent  variable.  Therefore,  the  definition  of 
our  dependent  and  independent  variables  is  in  jeopardy.  Hence,  this  model  violates  the 
traditional 
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approach  to  regression  unless  we  reverse  the  roles  of  the  variables,  in  which  case  model 
(1)  appears  to  be  appropriate.  Therefore,  we  consider  the  statistical  utility  of  model  (1)  in 
the  Air  Force  Health  Study. 


3.4  Estimating  Exposure  Effects 

Because  the  concept  of  “exposure”  is  central  to  the  AFHS,  one  may  define  a 
subject  as  exposed  (e=l )  if  his  dioxin  level  is  above  Cio  and  unexposed  (e=0)  otherwise. 
Using  model  (1),  the  mean  of  Y  among  the  exposed,  /ryE ,  can  be  written  as 


Ays  =  E(Yji|e  =  1,  j  =  0)P(j  =0|e  =  1)  +  E(Yj,|e  =  1,  j  =  l)P{j  —  1  |e  =  1) 
=  {P„  +AtE(X„i|Xoi>c,o)-ft4  P(C ~  =  0> 

+  {A  +A[E(x,i|x„>c,o)-A„]}  P-~- p^r-^0—  • 

Similarly,  the  mean  of  Y  among  the  unexposed  is 

My-z  =  E(Yji|e  =  0,  j  =  0)PG  =  0|e  =  0)  +  E(Y,\e  =  0,  j  =  l)P(j  =  l|e  =  0) 
=  {A  +A[E(Xo,IXoi<c,„)-^4  P(e  =  P(  °=°oP(i  =  0> 

+{A  +A[E(X„|X„<C,.)-//.,)}  P(e  =  ^]e°‘^(i  =  1). 


However,  the  probabilities  P(j:=0)  and  P(j =1 )  are  generally  not  estimable.  Hence  we 
consider  the  estimable  difference  of  the  Y  mean  of  exposed  index  subjects  and  the  Y 
mean  of  unexposed  controls,  given  by  D  =  /ryE1  -  ^yEo ,  where  fiyE]  =  E(Y|  e  =  1,  j  =  1)  and 

Ayio =  E(Y|  e  =  0,  j  =  0)  which,  under  model  (1),  is  given  by 
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D  /?,  [Pei  (AxEI  "  Ax El)  +  PeO  (AxEO  "  AxEo)]  > 

where  pEk=P(e=  1  |j=k),  pEk  =  P(e  =  0|j  =  k),  /ixEk  =  E(X|  e  =  l,j  =  k)  and 
//xEk  =  E(X|  e  =  0,  j  =  k) ,  k=0, 1 .  Hence  D  is  estimated  by 

D  =  /?,  [pa  (Xe,  -  Xe.)  +  Peo  (Xeo  -  Xio)] . 

where  Xek  an<l  Xik are  the  sample  means  of  exposed  and  unexposed  subjects  in  group  k, 
k=0,l,  pE1  is  the  sample  proportion  of  exposed  subjects  in  group  1  and  pE0  is  the  sample 
proportion  of  unexposed  subjects  in  group  0.  The  estimate  of  the  conditional  standard 

/V 

deviation  of  D ,  given  X,  is 

3d  ~  cyp  [Pei(Xei'Xei)  +  Peo(Xeo~Xeo)]  • 

Hence,  to  test  the  hypothesis  Ho:D=0,  we  use  the  statistic  T-px  /  crp  ,  distributed  as  t 

with  n-2  degrees  of  freedom  under  H0.  In  the  special  case  that  all  of  the  index  subjects 
are  exposed  and  all  of  the  controls  are  unexposed,  D  reduces  to  the  difference  of  group 
means  E(Yy=l)-E(Ylj=0)  and  the  appropriate  test  statistic  is  the  ordinary  two  sample  t- 
test. 

For  example,  to  estimate  D  in  the  Air  Force  Health  Study  data  on  log(insulin)  and 
dioxin,  we  have  ^  =  0.18 ,  =  0.031 ,  pE1  =  0.44  ,pE0  =  0.02 ,  Xei  =  3.384, 

Xii  =  1.836,  Xeo  =  2.705  and  Xeo  =  1-501.  Therefore,  D  =  0.13  and  crb  =  0.022 . 

Hence  a  95%  confidence  interval  for  D  is  0.09  to  0.17  and  the  test  statistic  for  H0  is 
T=5.81  (p<0.001). 
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4.  DISCUSSION 


There  is  currently  no  convincing  explanation  for  the  check  mark  pattern  in  the  Air 
Force  study.  The  relationship  between  dioxin  and  insulin  could  reflect  direct  causation, 
bias,  reverse  causation,  or  the  effects  of  differential  dioxin  elimination.  The  association 
is  consistent  with  direct  causation  through  dioxin  blocking  insulin  receptors.  This 
hypothesis  would  explain  the  trend  of  increasing  insulin  with  increasing  dioxin  in 
nondiabetic  Ranch  Hands  and  a  decrease  in  mean  insulin  with  increased  dioxin  in 
diabetic  Ranch  Hands  (data  not  shown;  see  Grubbs  et  al  1995).  However,  this 
explanation  is  not  adequate  to  also  explain  the  near  equality  of  the  insulin  means  in 
Ranch  Hands  and  controls.  The  near  equality  of  observed  group  insulin  means  follows 
from  standard  bias  models  (Anderson  et  al  1980).  If  the  cut  point  we  regard  as  the 
threshold  for  exposure  (10  ppt)  truly  separates  exposed  from  unexposed,  then  44%  of  the 
Ranch  Hand  group  is  unexposed,  causing  the  observed  group  mean  difference  to  be 
biased  toward  equality.  This,  together  with  a  positive  correlation  between  dioxin  and 
insulin  in  Ranch  Hand  veterans,  would  produce  the  check  mark  pattern. 

Reverse  causation  could  occur  through  contaminated  medications.  It  is  possible 
that  medications  are  contaminated  with  trace  amounts  of  dioxin,  as  are  many  foods.  If  so, 
repeated  doses  of  medications  could,  over  a  period  of  years,  increase  the  dioxin  body 
burden  of  subjects  with  disease.  However,  this  possibility  cannot  explain  the  association 
between  dioxin  and  log(insulin)  because  we  restricted  the  analysis  to  nondiabetic  subjects 
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who  are  not  taking  insulin-controlling  medications.  Nevertheless,  there  might  be  other 
sources  of  dioxin  uptake  which  correlate  with  insulin  that  could  account  for  the  pattern. 

The  known  association  between  dioxin  half-life  and  percent  body  fat,  a  correlate 
of  insulin,  might  also  contribute  to  the  association  between  insulin  and  dioxin.  Obese 
persons  have  a  longer  dioxin  half-life  than  lean  persons  (Michalek  et  al,  1995).  Thus, 
veterans  who  are  overweight  are  more  likely  to  have  elevated  insulin  levels,  but  are  also 
more  likely  to  retain  their  dioxin  longer  than  lean  veterans.  However,  the  association 
between  dioxin  and  log(insulin)  remains  significant  even  after  adjustment  for  percent 
body  fat. 

Interpretation  of  the  check  mark  pattern  in  the  CSF-EAA  study  requires 
consideration  of  the  mechanism  of  pain  transmission  (referred  to  as  nociception). 
Nociception  involves  a  complex  series  of  electrochemical,  receptor  ligand  interaction, 
and  second  messenger  processes  which  sequentially  occur  in  peripheral  nerves,  the  spinal 
cord,  and  the  brain.  This  process  is  further  complicated  because  several  steps  are  subject 
to  down-regulation  or  “sensory  interpretation”  by  inhibitory  (antinociceptive) 
neurochemical,  receptor  ligand,  and  second  messenger  effects. 

A  very  abbreviated  view  of  the  nociceptive  process  is  that  a  painful  peripheral 
stimulus  initiates  the  electrochemical  depolarization  of  a  primary  efferent  neuron  which 
releases  substance  P  and  excitatory  amino  acids  (EAA)  into  the  dorsal  horn  of  the  spinal 
cord  (Malmberg  and  Yaksh  1992).  Substance  P  facilitates  the  nociceptive  activity  of  the 
EAA  which  induce  the  release  of  prostaglandins  and,  perhaps,  nitric  oxide  from  arginine. 
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The  prostaglandin  messenger  then  causes  depolarization  of  the  ascending  spinal  cord 
neuron  which  carries  the  nociceptive  message  to  the  brain. 

Antinociception  is  accomplished  though  inhibitory  contributions  from  a  peptide 
fragment  resulting  from  protease  digestion  of  substance  P  (substance  Pi-?),  serotonin, 
taurine,  and  endogenous  opioids  (Homfeldt  et  al  1 992,  Smullin  et  al  1 990,  Skilling  et  al 
1990).  When  the  spinal  cord  of  rats  was  exposed  to  intact  substance  P,  it  caused  the 
release  of  taurine,  but  substance  Pi -7  seemed  to  inhibit  that  release. 

The  actual  role  of  taurine  in  nociception  and  antinociception  is  not  yet  clear.  The 
tissue  concentration  of  taurine  decreases  in  the  hypothalamus  and  lower  brain-stem  nuclei 
of  rats  experiencing  acute  pain  (Palkovitz  et  al  1986).  It  should  be  noted,  however,  that 
the  brain  concentration  of  a  given  neurochemical  may  vary  inversely  with  its 
concentration  in  the  spinal  cord  (Sharma  et  al  1990).  Intratheceally-injected  taurine 
appeared  to  induce  writhing  pain  in  animals  (Larson  1989).  Conversely,  taurine  inhibited 
the  pain-like  responses  of  rats  subjected  to  intrathecal  injection  of  substance  P  or  to  the 
acetic  acid-induced  writhing  test  (Smullin  et  al  1990),  and  both  of  these  effects  could  be 
blocked  by  administration  of  the  taurine  antagonist  6-aminoethyl-3-methyl-4H-l,2,4- 
bendo-thiadiazine-1 ,1 -dioxide  (TAG). 

The  correlation  of  taurine  with  the  TPI  measure  of  pain  herein  described  as  an 
example  of  the  check  mark  pattern  may  simply  represent  the  association  of  taurine  with 
the  nociception/antinociception  process.  In  other  words,  the  pattern  might  result  from  a 
combination  of  direct  and  reverse  causation,  with  pain  causing  changes  in  taurine  levels 
and  taurine  levels  causing  changes  in  pain.  The  FS  patients  were  experiencing  a  range  of 
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painful  sensations  prior  to  the  TPI  examination,  while  the  normal  controls  were  not 
experiencing  pain  de  novo  and  the  examination  pressure  was  not  perceived  by  them  to  be 

painful.  It  is  possible  that  a  number  of  other  chemical  mediators  of  nociceptive  and 
antinociceptive  processes  would  also  exhibit  a  similar  pattern  if  they  are  not  too  rapidly 
degraded  in  the  course  of  normal  homeostatic  regulation. 

We  conclude  that  the  check  mark  pattern  might  be  caused  by  direct  or  reverse 
causation,  misclassification  bias,  or  differential  dioxin  elimination  in  the  Air  Force  study 
and  that  the  pattern  may  arise  from  a  combination  of  direct  and  reverse  causation  in  the 
CSF-EAA  study.  Thus,  the  reverse  causation  explanation  for  the  checkmark  pattern 
appears  plausible,  even  though  a  previously  published  study  of  reverse  causation  is 
mathematically  incorrrect.  Finally,  we  have  demonstrated  that  a  simple  linear  model 
describes  the  pattern  and  that  the  model  can  be  used  to  estimate  exposure  effects  in  the 
Air  Force  study. 
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6.  APPENDIX 


In  our  notation,  Flanders  et  al  Model  2  for  subject  i,  i=l,2, n,  is  given  by 

X2i=a+bX  i  j+cY  i+Sj, 

where  X2  is  measured  dioxin,  X\  is  true  dioxin,  Y  is  the  health  variable  and  s  is  error. 
The  four  variables,  Xi,  X2,  Y  and  s  are  assumed  jointly  normally  distributed  with 
E(X,)=m,  VAR(X,)=  a] ,  E(Y)=|xy,  VAR(Y)=<72 ,  E(s3)=0  and  VAR(e)=o-2  and  Xlf  Y 

and  s  are  assumed  mutually  independent.  Thus  health  (Y)  and  measured  dioxin  X2  are 
bivariate  normal  with  mean  vector  (juy ,  a  +  b  //,  +  c  piy) ,  VAR(X2)=  c2  cr2  +  b2  cr2  +  al , 
and  COV(Y,X2)=  ccr2 .  Therefore  E(Y|X2=x2)=y  o+YiX2,  where 

fc<7y(a  +  b//,  +  CyW  )] 

y 0  ~  Ay  j  2  2  ,  ,  2  FT  2  [ 

[  C  CTy  +  b  (Tl  +  tjc  J 

and 


1 1  2  2,1 2  2  , _ 2  ' 

c  cry  +  b  at  +ae 

But  Flanders  et  al  also  assume  E(Y[j=0)=E(Y[j=l),  where  j=0  for  control  subjects  and  j=l 
for  index  subjects,  which  implies 

y0+y  1  E(X2  [j =o)=y  0+y  1  E(x2  (j = 1 ) . 
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But  E(X2[j=l)>E(X2[j=0).  Hence,  yi  =0  and,  therefore,  c=0.  Thus,  Model  2  and  their 
assumption  that  E(Y|j=l)=E(Y[j=0)  imply  that  Y  and  X2  are  unrelated.  As  a  result,  k=0 
in  their  equation  (8)  and  so  their  conclusions  regarding  the  check  mark  pattern  and 
reverse  causation  are  not  correct. 
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